DISTRIBUTED SELFISH LOAD BALANCING 



PETRA BERENBRINKt, TOM FRIEDETZKY*, LESLIE ANN GOLDBERG 5 , 

PAUL W. GOLDBERG 8 , ZENGJIAN HU 1 ", AND RUSSELL MARTIN 5 

Abstract. Suppose that a set of m tasks are to be shared as equally as possible amongst a set of n resources. A game- 
theoretic mechanism to find a suitable allocation is to associate each task with a "selfish agent", and require each agent to 
select a resource, with the cost of a resource being the number of agents to select it. Agents would then be expected to migrate 
from overloaded to underloaded resources, until the allocation becomes balanced. 

Recent work has studied the question of how this can take place within a distributed setting in which agents migrate selfishly 
without any centralized control. In this paper we discuss a natural protocol for the agents which combines the following desirable 
features: It can be implemented in a strongly distributed setting, uses no central control, and has good convergence properties. 
For m S> n, the system becomes approximately balanced (an e-Nash equilibrium) in expected time 0(log log m). We show 
using a martingale technique that the process converges to a perfectly balanced allocation in expected time O (log log m + n 4 ). 
We also give a lower bound of £7(max{log log m, n}) for the convergence time. 

1. Introduction. Suppose that a consumer learns the price she would be charged by some domestic 
power supplier other than the one she is currently using. It is plausible that if the alternative price is lower 
than the price she is currently paying, then there is some possibility that she will switch to the new power 
supplier. Furthermore, she is more likely to switch if the ratio of current price to new price is large. If there is 
only a small saving, then it becomes unattractive to make the switch, since an influx of new business (oneself 
and other consumers) may drive up the price of the new power supplier and make it no longer competitive. 

We study a simple mathematical model of the above natural rule, in the context of a load balancing 
(or task allocation) scenario that has received a lot of recent attention. We assume the presence of many 
individual users who may assign their tasks to chosen resources. The users are selfish in the sense that 
they attempt to optimize their own situation, i.e., try to assign their tasks to minimally loaded resources, 
without trying to optimize the global situation. In general, a Nash equilibrium among a set of selfish users 
is a state in which no user has the incentive to change her current decision. In our setting, this corresponds 
to no user having an incentive to reallocate their task to some other resource. An e-Nash equilibrium is a 
standard notion of an approximate Nash equilibrium, and is a state in which no user can reduce her cost by 
a multiplicative factor of less than 1 — e by changing action. Here we do not focus on the quality of equilibria, 
but rather on the (perhaps more algorithmic) question of convergence time to such a state. 

We assume a strongly distributed and concurrent setting, i.e., there is no centralized control mechanism 
whatsoever, and all users may choose to reallocate their tasks at the same time. Thus, we do not (and cannot) 
use the Elementary Step System [26] (discussed in more detail in the next section), where the assumption is 
that at most one user may reallocate her task at any given stage. 

Throughout we let m denote the number of tasks (in the above discussion, customers) and n the number 
of resources (power suppliers). As hinted in the above discussion, we assume that typically m 3> n. In a 
single time step (or round) each task does the following. Let i be the resource currently being used by the 
task. Select j uniformly at random from {1, . . . ,n} and find the load of resource j. Let Xi and Xj be the 
loads of resources i and j respectively. If Xj < Xi, migrate from i to j with a probability of 1 — Xj/Xi\ the 
transition from round t to round t + 1 is given in Figure fTTTl Notice that if we had unconditional migrations, 
i.e., without an additional coin flip (move only with probability 1 — Xj(t)/Xi(t)), then this may lead to 
an unstable system; consider for example the case m = 2 with initially most tasks assigned to one of the 
resources: the overload would oscillate between the two resources, with a load ratio tending towards 2:1. 
(This observation about the risk of oscillation has also been made in similar contexts in [331 [12], and we will 
not elaborate on it further.) 

It can easily be seen that, if all tasks use the above policy, then the expected load of every resource at 



'Preliminary version in proceedings of ACM-SIAM Symposium on Discrete Algorithms (SODA) 2006. This work was 
partially supported by the EPSRC grants "Discontinuous Behaviour in the Complexity of Randomized Algorithms" and "Al- 
gorithmics of Network-sharing Games" , and by the Natural Sciences and Engineering Research Council of Canada (NSERC) 
discovery grant 250284-2002. 

T School of Computing Science, Simon Fraser University, Canada 

tDepartment of Computer Science, University of Durham, U.K. 

§ Department of Computer Science, University of Liverpool, U.K. 



For each task b do in parallel 

Let ijj be the current resource of task b 
Choose resource jb uniformly at random 
Let Xi b (t) be the current load of resource i 
Let Xj b (t) be the current load of resource j 
IfX ib (t) >X jb (t) then 

Move task b from resource it to jt with probability 1 — Xj b (t) / X ib (t) 



Fig. 1.1. The protocol with "neutral moves" allowed. 

the next step is m/n: 

Observation 1.1. Regardless of the load distribution at time step t, the expected load of every resource 
at the next step is m/n. 

Proof. To see this, assume that the loads Xi{t) are arranged in descending order so that Xj(t) > Xj +1 {t) 
and note that 
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This provides a compelling motivation for the policy, which is that as a result, no task has an incentive 
to deviate unilaterally from this policy. This implies that in the terminology of [5] it is a Nash rerouting 
policy. It is also a simple regret-minimizing policy in the sense of [5] since the average cost of resources used 
by an agent is no higher than the best choice of a single resource to be used repeatedly. Although the above 
rule is very natural and has the nice properties described above, we show that it may take a long time to 
converge to a perfectly balanced allocation of tasks to resources. We address this problem as follows. Define 
a neutral move to be a task migration from a resource with load £ at time t to a resource with load £ — 1 
at time t (so, if no other task migrates, then the cost to the migrating task is unchanged.) We consider 
a modification in which neutral moves are specifically disallowed (see Figure [2TT]) . That seemingly-minor 
change ensures fast convergence from an almost balanced state to a perfectly-balanced state. To summarize, 
here are the most important features of the modified protocol: 

• We do not need any global information whatsoever (apart from the number of available resources); 
in particular, a task does not need to know the total number of tasks in the system. Also, it is 
strongly distributed and concurrent. If additional tasks were to enter the system, it would rapidly 
converge once again, with no outside intervention. 

• A migrating task needs to query the load of only one other resource (thus, doing a constant amount 
of work in each round). 

• When a task finds a resource with a significantly smaller load (that is, a load that is smaller by at 
least two), the migration policy is exactly the same as that used by the Nash rerouting policy of 
Figure II. li so the incentive is to use that probability. 

• When a task finds a resource with a load that is smaller by exactly one unit, the migration policy 
is sufficiently close to the Nash rerouting policy that the difference in expected load is at most one, 
and there is little incentive to deviate. 

• The protocol is simple (as well as provably efficient) enough to convince users to actually stick to it. 
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1.1. Related Work. We are studying a simple kind of congestion game. In their general form, conges- 
tion games specify a set of agents, and a set of resources, and for each agent, a set of allowed strategies, where 
a strategy is the selection of a subset of the resources (in this paper, any singleton subset is allowed). The 
cost of a resource is a non-decreasing function of the number of agents using it, and the cost for an agent is 
the sum of the costs of resources it uses. A classical result due to Rosenthal [27] is that pure Nash equilibria 
(NE) always exist for congestion games, and this is shown by exhibiting a potential function; they are a type 
of potential game [25] . The potential function also establishes that pure NE can be found via sequences of 
"better-response" moves, in which agents repeatedly switch to lower-cost strategies. The potential function 
we use later in this paper is the one of [27] . modulo a linear re-scaling. 

These results do not show how to find Nash equilibrium efficiently, the problem being that in the worst 
case, sequences of these self-improving moves may be exponentially-long. The following questions arise: 
when can NE be found by any efficient algorithm, and if so, whether it can be found via an algorithm that 
purports to be a realistic model of agents' behavior. Regarding the first of these questions, the answer is 
no in the general setting (the problem is PLS-complete for general congestion games [TU], see also [TJ 
PLS-completeness (introduced in [18] ) is a generally-accepted criterion for intractability of computational 
problems in which we seek a local optimum of a given objective function. 

However, due to the basic fact of [27] [55] that pure NE are sure to result from a sufficiently long better- 
response sequence, many algorithms for finding them are based on such sequences. An important sub-class is 
the Elementary step system (ESS), proposed in Orda et al. [26] . which consists of best-response moves (where 
a migrating agent switches not to any improved choice, but to one that is optimal at the time of migration). 
For matroid games (a class of congestion games that includes the ones we consider here), Ackermann et al. [1] 
show that best-response sequences must have length polynomial in the number of players, resources, and 
maximal rank of the matroids. In this paper we consider the special case of singleton congestion games (where 
players' strategies are always single resources, thus the ranks of the matroids is 1). For these games, Ieong et 
al. [17j give polynomial bounds for best-response and better-response sequences. Chien and Sinclair |3J study 
a version of the ESS in the context of approximate Nash equilibria, and show that in some cases the e-Nash 
dynamics may find an e-NE where finding an exact NE is PLS-complete. Mirrokni and Vetta [23 study the 
convergence rate of the ESS to solutions, and the quality of the approximation after limited iterations. 

While best- and better-response dynamics are a plausible model of selfish behaviour, the associated 
algorithms typically require that migrations be done one-by-one, and another common assumption is that 
best (not better) responses are always selected. This means that to some extent, agents are being assumed 
to be governed by a centralized algorithm that finds a NE, and raises the question of what sort of distributed 
algorithms can do so, especially if agents have limited information about the state of the system (and so may 
not be able to find best responses). That issue is of central importance to us in this paper. Goldberg [15] 
studied situations where simple better-response approaches can be realised as weakly distributed algorithms 
(where each agent looks for moves independently of the others, but it is assumed that moves take place 
consecutively, not simultaneously). In a strongly distributed setting (as we study here), where moves may 
occur simultaneously, we need to address the possibility that a change of strategy may increase an agent's 
cost. It may happen that after a best response has been identified, it is not optimal at the time it is 
executed. Even-Dar and Mansour [9] consider concurrent, independent rerouting decisions where tasks are 
allowed to migrate from overloaded to underloaded resources. Their rerouting process terminates in expected 
0(loglogm + logn) rounds when the system reaches a Nash equilibrium. Note that their convergence rate 
as a function of the number n of resources is faster than the one we obtain in this paper. The reason is 
that is requires agents to have a certain amount of global knowledge. A task is required to know whether its 
resource is overloaded (having above-average load) and tasks on underloaded resources do not migrate at all. 
Our rerouting policy does not require that agents know anything other that their current resource load, and 
the load of a randomly-chosen alternative. Even-Dar and Mansour also present a general framework that 
can be used to show a logarithmic convergence rate for a wide class of rerouting strategies. Our protocol 
does not fall into that class, since we do not require migrations to occur only from overloaded resources. 
Note that our lower bound is linear in n (thus, more than logarithmic). 

Distributed algorithms have been studied in the Wardrop setting (the limit of infinitely many agents) , for 
which recent work has also extensively studied the coordination ratio (29[|28]. Fischer et al. [12] investigate 
convergence to Wardrop equilibria for games where agents select paths through a shared network to route 
their traffic. (Singleton games correspond to a network of parallel links.) Their re-routing strategies are 
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slightly different to ours — they assume that in each round, an agent queries a path with probability 
proportional to the traffic on that path. Here we assume paths (individual elements of a set of parallel 
links) are queried uniformly at random, so that agents can be assumed to have minimal knowledge. As in 
this paper, the probability of switching to a better path depends on the latency difference, and care has to 
be taken to avoid oscillation. Also in the Wardrop setting, Blum et al. [2] show that approximate NE is 
the outcome of regret-minimizing rerouting strategies, in which an agent's cost, averaged over time, should 
approximate the cost of the best individual link available to that agent. 

Certain generalisations of singleton games have also been considered. These generalisations are not 
strictly congestion games according to the standard definition we gave above, but many ideas carry over. One 
version introduced by Koutsoupias and Papadimitriou [19] has been studied extensively in different contexts 
(for example [UJ [7J Q31 US]). In this generalisation, each task may have a numerical weight (sometimes 
called traffic, or demand) and each resource has a speed (or capacity). The cost of using a resource is the 
total weight of tasks using it, divided by its speed. Even-Dar et al. [5] give a generalized version of the 
potential function of [57J that applies to these games, and which was subsequently used in [15]. For these 
games however, it seems harder to find polynomial-length best-response sequences. Feldman et al. show 
how a sequence of steps may lead to NE, under the weaker condition that the maximal cost experienced 
by agents must not increase, but individual steps need not necessarily be "selfish". They also note that 
poorly-chosen better-response moves may lead to an exponential convergence rate. Another generalisation 
of singleton games is player- specific cost functions [22], which allow different agents to have different cost 
functions for the same resource. In this setting there is no potential function and better-response dynamics 
may cycle, although it remains the case that pure NE always exist. 

Our rerouting strategy is also related to reallocation processes for balls into bins games. The goal of a 
balls into bins game is to allocate m balls as evenly as possible into n bins. It is well-known that a fairly 
even distribution can be achieved if every ball is allowed to randomly choose d bins and then the ball is 
allocated to the least loaded amongst the chosen bins (see [53] for an overview). Czumaj et al. [5] consider 
such an allocation where each ball initially chooses two bins. They show that, in a polynomial number of 
steps, the reallocation process ends up in a state with maximum load at most \mjn\ + 1. Sanders et al. |30] 
show that a maximum load of \m/n\ + 1 is optimal if every ball is restricted to two random choices. 

In conclusion, this paper sits at one end of a spectrum in which we study a very simple load-balancing 
game, but we seek solutions in a very adverse setting in which agents have, at any point in time, a minimal 
amount of information about the state of their environment, and carry out actions simultaneously in a 
strongly distributed sense. 

1.2. Overview of our results. Section [3] deals with upper bounds on convergence time. The main 
result, Theorem 13. 1[ is that the protocol of Figure [2~T1 converges to a Nash equilibrium within expected time 
(9(log log m + n 4 ). 

The proof of Theorem 13.11 shows that the system becomes approximately balanced very rapidly. Specifi- 
cally, Corollary 13 . 1 1 1 shows that if n < m 1 / 3 , then for all e, either version of the distributed protocol (with or 
without neutral moves allowed) attains an e-Nash equilibrium (where all load ratios are within [1 — e, 1 + e]; 
we use e to denote a multiplicative factor as in [3]) in expected O(loglogm) rounds. The rest of Section [3] 
analyses the protocol of Figure |2~T1 It is shown that within an additional 0(n 4 ) rounds the system becomes 
optimally balanced. 

In Section |4j we provide two lower bound results. The first one, Theorem 14. 1[ shows that the first 
protocol (of Figure [TTTl including moves that do not necessarily yield a strict improvement for an individual 
task but allow for simply "neutral" moves as well, results in exponential (in n) expected convergence time. 
Finally, in Theorem 14. 21 we provide a general lower bound (regardless of which of the two protocols is being 
used) on the expected convergence time of f2(loglogm). This lower bound matches the upper bound as a 
function of m. 

2. Notation. There are m tasks and n resources. An assignment of tasks to resources is represented 
as a vector (xi, . . . ,x n ) in which Xi denotes the number of tasks that are assigned to resource i. In the 
remainder of this paper, [n] denotes {1, . . . , n}. The assignment is a Nash equilibrium if for all i G [n] and 
j £ [n], \xi — Xj\ < 1. We study a distributed process for constructing a Nash equilibrium. The states 
of the process, X(0),X(1), . . ., are assignments. The transition from state X(t) — (X±(t), . . . , X n (t)) to 
state X(t+ 1) is given by the greedy distributed protocol in Figure |2~T1 
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For each task b do in parallel 

Let ijj be the current resource of task b 
Choose resource jb uniformly at random 
Let Xi b (t) be the current load of resource i 
Let Xj b (t) be the current load of resource j 
If X ib (t) > X jb (t) + 1 then 

Move task b from resource it to jt with probability 1 — Xj b (t) / X ib (t) 



Fig. 2.1. The modified protocol, with "neutral moves" disallowed. 



Note that if X(t) is a Nash equilibrium, then X(t + 1) = X(t) so the assignment stops changing. 
Here is a formal description of the transition from a state X{t) = x. Independently, for every i G [n], 
let (Yi t i(x), . . . ,Yi t n(x)) be a random variable drawn from a multinomial distribution with the constraint 
Ej=i Yi.j(x) = X{. (Yij represents the number of migrations from i to j in a round.) The corresponding 
probabilities (pi t i(x), . . . ,Pi. n {x)) are given by 

( f) i£x i >x j + l, 

Pi,j( x ) = S if i ^ j but Xl < Xj +1, 

ThenX^ + l) = E"=i 

For any assignment x — (x\, . . . , x n ), let x — — EiLi ^i' We define the potential function <E>(x) — 

e?=i - ^) • N ° te that $ o) = Er= 

^ xf — nx 2 , and that a single selfish move reduces the potential. 
3. Upper bound on convergence time. Our main result is the following. 

Theorem 3.1. Let T be the number of rounds taken by the protocol of Figure \2.1\ to reach a Nash 
equilibrium for the first time. Then E[T] = 0(loglogm + n 4 ). 

The proof of this theorem proceeds as follows. First fLemma !3.6p we give an upper bound on E[<f>(X(t))] 
which implies (Corollarv l3.10[) that there is a r = O(loglogm) such that, with high probability, $(X(t)) = 
0(n). We also show (Observation 13.51 and Corollary 13. 14j) that $>(X(t)) is a super-martingale and (Lemma 
I3.15P that it has enough variance. Using these facts, we obtain the upper bound on the convergence time. 
Definition: Let Si(x) = {j | Xj < Xi — 1}. Si(x) is the set of resources that are significantly smaller than 
resource i in state x (in the sense that their loads are at least two tasks smaller than the load of resource i). 
Similarly, let L t {x) = {j \ Xj > x t + 1} and let d t {x) = i J2 r .\x -x \<i( x 



Observation 3.2. E[Xi{t + 1) | X(t) = x] =x + di(x). 
Proof. 
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Observation 3.3. £™ =1 (E[X t {t + 1) | X(t) = x]f = nx 2 + £* =1 <U(x) 2 . 
Proof. Using Observation 13. 2[ 

n n n n 

£(E[JCi(t+l) | X(t) = x]) 2 = ^2(x + di(x)) 2 = nx 2 + 2x^2di{x) +^2di{x) 2 , 

i—l i— 1 i—l i— 1 

and the second term is zero since di(x) = E[JQ(t + 1) | X(t) = x] — x. □ 

Observation 3.4. var[Xi(t + 1) | X(t) = x] < ± £ <ei4(x) ~ + S £jes«(«0 ~ ^j)- 
Proo/. 

n n 
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Definition: For any assignment x, let Si(x) = \{j \ Xj = x, r — 1}| and U(x) = \{j \ Xj = Xi + Let 

w i0) =Y,i=iY,j£[ n ]:\ Xi -x i \>i\ x i - x j\ and "afc) =T,7=i( s i( x )- l i{ x )f- Let u(a;) = ui(x)/n + u 2 (a;)/^ 2 - 
We will show that u(x) is on upper bound on the expected potential after one step, starting from state x. 
The quantity u\ (x) corresponds to the contribution arising from the sum of the variances of the individual 
loads and 1(2(2) corresponds to the rest. 

Observation 3.5. E[$(X(i + l)) | X(t) =x]< u(x). 

Proof. 

n 

E[$(X(t + l)) I X(t) = x] +nx 2 = ^TE[Xi(t + l) 2 | X(t) = x] 

i=i 

n n 

= ^(E[Xi(t+l) I X(t) = x]) 2 + ^var(X l (i + l) | X{t) = x). 

i=l i=l 

Using Observations 13.31 and 13.41 this is at most nx 2 + Y17=i di{x) 2 + U\{x)/n. But 



di(x) = - V* (xi - Xj) = ~(si(x) - ti(x)), 



n 

j:\xi-Xj\Kl 



so the result follows. □ 

Lemma 3.6. E[<5>{X(t + 1)) | X(t) = x]<n + 2n 1 / 2 <S>(x) 1 / 2 . 

Proof. In the proof of Observation [331 we established that E[$(X(f + 1)) | X(t) = x] < £™ =1 di(x) 2 
Ui(x)/n. Upper-bounding Ui(x) and using di(x) < 1, we have 



n n 

E[$(X(i + l)) \X(t)=x] <n+-^^|xi-:r j | 



n 

i=i 3=1 



and since |x, — xj\ < |xj — x\ + \xj — x\, this is at most n + 2^™ =1 |xj — x\. By Cauchy-Schwarz, 



(Ei " «l ' l ? < \ x i ~ x \ 2 E t 1 so 
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Corollary 3.7. E[$pf(t + 1))] < n + 2n 1 / 2 (E[<I>(A(i))]) 1/2 . 

Proof. Using Lemma 13.61 E[<&(X(t + 1))] < n + 2n 1 / 2 E[/ 1 / 2 ] where / denotes the random variable 
®{X(t)). By Jensen's inequality E[/ x / 2 ] < (E[/]) 1/2 since the square-root function is concave, so we get 
E[<5>(X(t + 1))] < n + 2n 1 / 2 {E[f]) 1/2 . Q 

Lemma 3.8. Either there is at' <t s.t. E[$(X(f))] < 18n orE[$(X(*))] < tf-- 2 '* n 1 -- 2 '* ^{X^)) 2 '* . 

Proof. The proof is by induction on t. The base case is t = 0. For the inductive step, note that 
1 - 2"* = £fe=i 2~ fe . Suppose that for all i' < t, E[$(X(t'))] > I8n (otherwise we are finished). Then by 
Corollary 13. 71 

E[*(Jr(i))] = n + 2n 1 ' 2 (E[<S>(X(t - 1))]) 1/2 < 3n 1/2 (E[<I>(A(t - 1))]) 1/2 . 
Applying the inductive hypothesis, 

m(X(t))} < 3n 1 / 2 (3 2 ( 1 - 2_(t " ) )n 1 - 2_(t " ) $(X(0)) 2_(t " > ) 1/2 . 

□ 

Corollary 3.9. There is ar < |~lglg$(X(0))] sucft that E[$(X(r))] < 18n. 

Proo/. Take < = |Tglg$(X(0))~|. Either there is a r < t with E[$(X(r))] < 18n or, by the lemma, 

E[®(X(t))] < 9n$(X(0)) 2_t < 18n. 

□ 

Corollary 3.10. There is a r < [lglg $(X(0))] such that Pr($(X(r)) > 720n) < 1/40. 

Proof. Consider the (non-negative) random variable Y = $(X(r)) where r is the quantity from Corol- 
lary [SHI Markov's inequality says that for any a > 0, Pr(F > a) < E[Y]/a. Now use Corollary 13.91 with 
a = 720n. □ 

Corollary 3.11. For all e > 0, provided that n < m 1 / 3 , the expected time to reach e-Nash equilibrium 
is O (log log m). 

Proof. Since the bound is asymptotic as a function of m for fixed e, we can assume without loss of 
generality that m > (60/e) 2 and that em/(2n) is an integer. We show that for any starting assignment X(0), 
there exists r < loglog(m 2 ) such that Pr(X(r) is e— Nash) > |§. This implies the statement of the result 
since the number of blocks of t steps needed to reach an e-Nash equilibrium is at most 




Suppose assignment x is not e-Nash. If X(t) = x there exist resources i,j with Xi(t) — Xj(t) > em/n. 
We use the following notation. Let A = em/(2n). Let (3 = Xi(t) - Xj(t) - 2A. Note (3 > 0. If X(t + 1) is 
obtained from X(t) by transferring A tasks from i to j, then 

&(X(t)) - $(X{t + 1)) = X t (t) 2 + X^tf - Xi(t + l) 2 - Xj(t + l) 2 

= (2A + (3 + X 3 {t)) 2 + X 3 {t) 2 -(A + P + Xjit)) 2 - (A + X (t)) 2 
= 2A(A + f3 + Xj(t)) + A 2 - (2AX j (t) + A 2 ) 
= 2A(A + (3) > A 2 = {em/2n) 2 . 

It follows that $(X(t)) > (em/2n) 2 . 

From Corollary [3TT01 Pr($(X(r)) < 720n) > |§, for r = loglog($(0)) = O (log log m). 

An assignment A"(r) with $(X(r)) < 720n must be e-Nash if (em/2n) 2 > 720n. Note that m > n 3 and 
m > (60/e) 2 . Hence, from e 2 (60/e) 2 n 3 > 4.720.n 3 , we can deduce e 2 m 2 > 4.720.n 3 , hence (em/2n) 2 > 720n. 
□ 

Corollary 13.101 tells us that $(A(r)) is likely to be 0(n). We want to show that <§>(X(t)) quickly gets 
even smaller (all the way to a Nash equilibrium) and to this end, we show that <&(X(t)) is a super-martingale. 
By Observation 13. 5\ it suffices to show u(x) < $(x), and we proceed with this. In the following, we shall 
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consider the cases \xi — x\< 2.5 for all i € [n] (Lemma I3.12p and 3i G [n] : \xt — a;| > 2.5 (Lemma 13. 13[) 
separately. 

Lemma 3.12. Suppose that assignment x = (x\, . . . ,x n ) satisfies \xi — x\< 2.5 for all i G [n]. Then 
u(x) < $(x). 

Proof. For all i G [n] and j G [n] we have \xi — Xj\ < \xi — x\ + \xj — x\ < 5. Let z = min^ Xi so every 
Xi G {z, . . . , z + 4}. Let ni = \{j | Xj = z + i}\. Then 

i=l Vi=l / \i=0 / \j=0 

Also, n 2 u(x) = nui(x) + U2(x), where 

ui(x) = n (2n 2 + 3n 3 + 4n 4 ) + n 1 (2n 3 + 3n 4 ) + n 2 (2n + 2n 4 ) + n 3 (3n + 2ni) + n 4 (4n + 3ni + 2n 2 ) 
and 

u 2 {x) = n nl + rii(n - n 2 ) 2 + n 2 (ni - n 3 ) 2 + n 3 (n 2 - n A ) 2 + mn\. 

Plugging in these expressions and simplifying, we get 

n 2 $>(x) — n 2 u(x) =4nonin 2 + 3no«3 + 4no^i"-3 + ^n-on 2 n3 + Anin 2 n^ + 3non3 + 8t7q7J4 + 12non\n4 

+ 371^714 + 8no7i2n4 + Ani^n^ + 12no7i37i4 + Aninsn^ + An 2 n^n4 + &non 2 + 3nin 2 , 

which is clearly non-negative since all coefficients are positive. □ 

Lemma 3.13. Suppose that assignment x — (x\ , . . . , x n ) satisfies \x n — x\ > 2.5 and, for all i G [n], 
\%i — x\ < \x n — x\. Let w — (w\, . . . ,w„_i) be the assignment with Wi — Xi for i G [n — 1]. Then 
<f>(x) — u(x) > $(kj) — u(w), that is, the lower bound on the potential drop for x is at least as big as that for 
w. 

Proof. Let k — \x n — x\. We will show 

(1) - $(w) > k 2 , and 

(2) u(x) - u{w) <2k+\. 
Then 

$0) - u{x) - - u{w)) > k 2 - (2k + 1), 

which is non- negative since k > 2.5 > 1 + v2. 

First, we prove (1). Let f(z) — Y^hZi ( x i ~ Z Y • Note that the derivative of f(z) is 

n-l 

f'(z) = 2(n - l)z - 2 Y Xi = 2(n - l)z - 2(n - l)w. 

Furthermore the second derivative is f"(z) = 2(n — 1) > 0. Thus, f(z) is minimized at z — w. Now note 
that 

n— 1 n— 1 

- = fc 2 + (xi - xf -^(xi- wf > k 2 . 

i=l i=l 

Now we finish the proof by proving (2). Assume first that x n — x + k. Then 

n n 

ui(x) - ui(w) = 2 \ x i - x n\ < 2^ \xi - x n \ = 2^2(x n - Xi) = 2nk. 

i£[n]:\xi — x n \>l i=l i=l 



Let Zj = \{£ \ xi — Clearly zj = for j > x n . Let £ = \x n — 2k~\ . For i G [n] we have xi > x — k = x n — 2k 
so z,j — for j < £. Now u 2 (x) = Ylj=£ z j( z j-i ~ z j+i) 2 - The representation of w in terms of ZjS is the 
same as the representation of x except that z Xn is reduced by one. Therefore, 



u 2 (x) - u 2 (w) = z Xn -i ({z Xn - 2 - z Xn f - (z Xn - 2 - z Xn + l) 2 ^j + +(z Xn ^ 

= z x n -i(-2z Xn - 2 + 2z Xn + z Xn -i - 1) < z Xn -i(2z Xn + z Xn -i). 



^2 

i — z x n +l) 



But since z Xn ^ Tt — z Xn —Xi the upper bound on the right-hand side is at most 

z Xn -i(2n - 2z Xn -i + 2 x „-i) = 2z Xn _i(n - z x „_i/2), 

which is at most n 2 since the right-hand side is maximized at z Xn ^i — n. To finish the proof of (2), use the 
definition of u to deduce that 

uip) - ui(w) m 2 (X) -m 2 (w) 

u[x) — u(w) < 1 - . 

n n z 

The proof of (2) when x n = x — k is similar. □ 

Corollary 3.14. For any assignment x = (xi, . . . ,x n ), $(cc) — u{x) > 0. 

Proof. The proof is by induction on n. The base case, n = 1, follows from Lemma T3. 121 Suppose n > 1. 
Neither $(a;) nor depends upon the order of the components in x, so assume without loss of generality 
that \xi — x\ < \x n — x\ for all i. If \x n — x\ < 2.5 then apply Lemma 13.121 Otherwise, use Lemma 13.131 to 
find an assignment w = (w\, . . . ,w n -i) such that — u(x) > &(w) — u(w). By the inductive hypothesis, 
- u(w) > 0. □ 

Together, Observation [33] and Corollary [331 tell us that E[$(X(t + 1)) | X(t) = x] < $(.t). The next 
lemma will be used to give a lower bound on the variance of the process. Let V = 0An~ 2 . 
Lemma 3.15. Suppose that X(t) = x and that x is not a Nash equilibrium. Then 

Pr($(X(t + 1)) ^ <P(x) | X{t) = x) > V. 



Proof. Choose s and £ such that for all i £ [n], x s < Xi < xg. Since x is not a Nash equilibrium, 
xi > x s + 1. Assuming X(t) — x, consider the following experiment for choosing X(t + 1). 

The intuition behind the experiment is as follows. We wish to show that the transition from X(t) to 
X{t+ 1) has some variance in the sense that Q(X(t + 1)) is sufficiently likely to differ from &(X(t)). To do 
this, we single out a "least loaded" resource s and a "most loaded" resource I as above. In the transition 
from X(t) to X(t + 1) we make transitions from resources other than resource £ in the usual way. We pay 
special attention to transitions from resource £ (and particular attention to transitions from resource £ which 
could either go to resource s or stay at resource £) . It helps to be very precise about how the random decisions 
involving tasks that start at resource £ are made. In particular, for each task b that starts at resource £, we 
first make a decision about whether b would accept the transition from resource £ to resource s if b happened 
to choose resource s. Then we make the decision about which resource task b should choose. Of course, we 
can't cheat and we have to sample from the original required distribution. Here are the details. 

Independently, for every i ^ £, choose (li,i(a;), . . . , Yi_ n (x)) from the multinomial distribution described 
in Section[2j (In the informal description above, this corresponds to making transitions from resources other 
than resource £ in the usual way.) Now, for every task b € X£, let Zb = 1 with probability 1 — x s /xt and 
Zb = otherwise. (In the informal description above, this corresponds to deciding whether b would accept 
the transition to s if resource s were (later) chosen.) Let x\ be the number of tasks b with Zb = 1 and let 
xj be the number of tasks b with Zb = 0. Choose (Y f + 1 (x), . . . , Y^^x)) from a multinomial distribution with 
the constraint Yll—i Y e + j(x) = x\ and probabilities given by 



l 



if 3 = s , 



+ f x \ = j H 1 ^ if 3 ^ 3 and Xt > Xj + lj 

' ' ' ' Q if j but x e < Xj + 1, 

Similarly, choose (Yf^x), . . . , Yp~ n (x)) from a multinomial distribution with the constraint Y^j=i Y/~j{x) 
and probabilities given by 



PeA x ) 



- / n 



ifj' = s, 

- % ) if j ^ s and x t > Xj + 1, 

if I 7^ j but xi < Xj < + I, 

1 -J2j=iePi,j( x ) if^ = J- 
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For all j, let Ygj(x) = Y^-{x) + Y t Ax). Informally, the pj • transition probabilities are set up so that 
packets which decided that they would accept a transition to s behave appropriately and the pjj transition 
probabilities are set up so that packets which decided that they would not accept a transition to s behave 
appropriately. By combining the probabilities, we see that X(t + 1) is chosen from the correct distribution 
in this way. 

Now, consider the transition from x to X(t + 1). Condition on the choice for (Yi^(x), . . . , Yi_ n (x)) for 
all i 7^ I. Suppose x\ > 2. Condition on the choice for (Yf^x), . . . , Yf n (x)). Flip a coin for each of the first 

x£ — 2 tasks with Zf, = 1 to determine which of Y^(x), . . . , Y^ (x) the task contributes to. Condition on 
these choices. Consider the following options: 

(1) Let x\ be the resulting value of X(t + 1) when we add both of the last two tasks to Y^ g (x). 

(2) Let xi be the resulting value of X(t + 1) when we add one of the last two tasks to Y^~ g (x) and the other 
to Y+(x). 

(3) Let X3 be the resulting value of X(t + 1) when we add both of the last two tasks to Y+ S (x). 

Note that, given the conditioning, each of these choices occurs with probability at least ri~ 2 . Also, ^(xi), 
$(x 2 ) and $(x 3 ) are not all the same. Thus, Pr($(X(t + 1) ^ $(x) | X(t) = x,xf > 2) > rT 2 . Also, 

\ Xp / \ / \ Xe — 1 

rp \ / \ / -T" 

S \ I \ I 



Pr(x+ > 2) = 1 - - 1-- - 

\x e J \ x t ) \x t 

Since the derivative with respect to x s is negative, this is minimized by taking x s as large as possible, namely 

xt - 2, so Vx(x\ > 2) > 1 — 7e~ 2 > 0.4, and the result follows. □ 

In order to finish our proof of convergence, we need the following observation about 

Observation 3.16. For any assignment x, <$>(x) < to 2 . Let r — to mod n. Then $(a;) > r(l — r/ri), 

with equality if and only if x is a Nash equilibrium. 

Proof. Suppose that in assignment x there are resources i and j such that x% — xj > 2. Let x be the 

assignment constructed from x by transferring a task from resource i to resource j. Then 

- §{x') = x\ ~ xf + x) - xf = xj - {x\ - 2x t + 1) + x) - (x) + 2xj + 1) 
= 2x t - 2xj - 2 = 2{xi - Xj) - 2 > 0. 

Now suppose that, in some assignment x', resources i and j satisfy x\ > x'j > 0. Let x be the assignment 
constructed from x' by transferring a task from resource j to resource i. Since (a^ + 1) — (x'j — 1) > 2, the 
above argument gives > $(x'). We conclude that an assignment x with maximum $(a;) must have all 
of the tasks in the same resource, with <f>(x) = to 2 . 

Furthermore, an assignment x with minimum $(x) must have \xi — Xj\ < 1 for all In this case there 
must be r resources with loads of q + 1 and n — r resources with loads of q, where to = qn + r. So 

= r(q + 1 - x) 2 + (n - r)(g - xf = r(l - +(n-r)^-^ = r ^1 - . 

Note that x is a Nash assignment if and only if \xi — Xj\ < 1 for all i and j. □ 

Combining Observation l3.16l and Corollarv l3 . 1 01 we find that there is a r < [lglgm 2 ] such that Pr($(X(r)) > 

720n) < 1/40. Let B = 7200n + [s^-l - ^. Let <' = t + [10S 2 /y] . 

Lemma 3.17. Given any starting state X(Q) — x, the probability that X(t') is a Nash equilibrium is at 
least 3/4. 

Proof. The proof is based on a standard martingale argument, see [20]. Suppose that $(A(r)) < 720n. 
Let W t = ®(X(t+T))-r(l-r/ri) and let D t = min(W t , B). Note that D < 720n. Together, Observation^ 
and Corollary 13 . 141 tell us that Wt is a supermartingale. This implies that D t is also a supermartingale since 

E[A+i \D t = x<B]< E[W t+1 \W t =x<B}<Wt = D t , 

and 

E[A+i I D t = B] < B = D t . 
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Together, Lemma OH and Observation HSH tell us that if x > 0, Pr(W t +i 7^ W { | W t = x) > V. Thus, if 

< x < B, 

Pr(A+i 7^ A I A = a) = Pr(min(W t+ i, B) £W t \W t =x) 

> Pr(W i+1 ^ W t A D ^ W t I W t = a?) = Pr(W t+ i ?W t \W t =x)>V. 

Since D t+ i — D t is an integer, E[(A+i — D t ) 2 | < D t < B] > V. Let T be the first time at which 
either (a) D t = (i.e., X(t + r) is a Nash equilibrium), or (b) Dt = D. Note that T is a stopping time. 
Define Zt = (B — Dt) 2 — Vt, and observe that ZtKT is a sub-martingale, where t AT denotes the minimum 
of t and T. Let p be the probability that (a) occurs. By the optional stopping theorem E[Dt] < Do, so 
(1 — p)B = E[Ar] < Do and p > 1 — Dq/B > yjj. Also, by the optional stopping theorem 

pD 2 - VE[T] = E[(D - D T ) 2 } - VE[T] = E[Z T ] > Z = (B - D Q ) 2 > 0, 

so E[T] < pB 2 /V. Conditioning on (a) occurring, it follows that E[T \ D T = 0] < B 2 /V. Hence Pr(T > 
10B 2 /V I Dt = 0) < jq. So, if we now run for 10B 2 /V steps, then the probability that we do not reach a 
Nash equilibrium is at most ^ + 2 • i < 1/4. 
Now we can give the proof of Theorem 13.11 

Proof. Subdivide time into intervals of t' steps. The probability that the process has not reached a Nash 
equilibrium before the (j + l)st interval is at most (1/4) 3 . □ 

4. Lower Bounds. In this section we prove the lower-bound results stated in the introduction. We 
will use the following Chernoff bound which can be found, for example, in [16]. Let N > 1 and let pi £ [0, 1] 
for i — l,...,N. Let Xt, X2, ■ ■ ■ , Xm be independent Bernoulli random variables with Pr(X^ = 1) = pi for 

1 = 1, . . . , N and let X = X 1 H h X N . Then we have ELY] = Y,?=i Pi and for < e < 1, 

Pr(X < (1 - e) ■ E[X}) < exp (- - '■ f X] ^ . (4.1) 

The following theorem gives an exponential lower bound for the expected convergence time of the process 
in Figure ITTTI 

Theorem 4.1. Let X{t) be the process in Figure [7771 with m — n. Let X(0) be the assignment given by 
X(0) = (n, 0, . . . ,0). Let T be the first time at which X(t) is a Nash equilibrium. Then E[T] = exp(0(y / n)). 

Proof. For an assignment x, let no(x) denote the number of resources i with Xi — 0. Thus, no(X(0)) = 
n — 1. The (unique) Nash equilibrium x assigns one task to each resource, so uq(x) = 0. Let k — [v^J- We 
will show that for any assignment x with no(x) > k, 

Pr(n (X(t)) < k I X(t - 1) = x) < exp(-6(Vn)). 

This implies the result. 

Suppose X(t— 1) = x with uq(x) > k. For convenience, let uq denote hq(x). Let x' denote X(t), and let 
n' denote no(x'). We will show that, with probability at least 1 — exp(— 0(y / n)), n' Q > k. During the course 
of the proof, we will assume, where necessary, that n is sufficiently large. This is without loss of generality 
given the notation in the statement of the result. 

Case 1. n > 8k. 

Consider the protocol in Figure [TTT1 Let U = {b \ Xj b = 0}. E[|[/|] = n , so by the Chernoff bound 
(Equation 6U), Pr(|[/| < \ f] + [^]) < Pi(\U\ < |n ) = exp(-6(^)) . Thus, \U\ > \n /2] + \3n /8~] 
with probability at least 1 — exp(— (^(y 7 ^)). Suppose this is the case. Partition U into U\ and U2 with 
\Ui\ = \n /2~\. Let W = UbeUiijb}- First, suppose |W| < |n . In that case 

3 3 

\{j I x'j > 0}| < n - \Ux\ + -n = n - [n /2] + -n < n - k, 

so n' Q > k. Otherwise, let U' = {b G U 2 | jh G W}. 

IWI 9 9 
E M = l^lU > -no > 

so by the Chernoff bound gUJ, Pr(|Z7'| < Jfe) = Pr(|C/'| < (1 - i)E[|t/'|]) = exp (-0(Vn)), recalling that 
= Lv^J- Thus \U'\ > k with probability at least 1 — exp(— 0(->/n)), which implies n > fe. 
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Case 2. k < n < Sk. 

Consider the protocol in Figure ITTTl Let L be the set of "loners" defined by L — {i \ x.- L — 1} and let 
I = \L\. The number of resources i with xi > 1 is n — hq — £ and this is at most half as many as the number 
of tasks assigned to such resources (which is n — £), so I > n — 2n$. Let U = {b \ i b <G L and Xj b = 0}. 
E[|[/|] = £2* > (ra ~ 2 " o)Ko = 6(Vn), so by the Chernoff bound l|0]l . Pr(\U\ < 2\{£?f]) < Pr([f7| < 
|E[|C/|]) < exp(-6(v^)). Thus, \U\ > 2["^2a] with probability at least 1 - exp(-Q(y/n)). Suppose this 
is the case. Let Ui and U2 be disjoint subsets of U of size T^ 1 ^ 1 ]- Order tasks in U arbitrarily and let 
S = {b £ U | for some V <E U with b' < b, jy = jb-}- (Note that |5| does not depend on the ordering.) Let 
W = U beUl {j b }. 

Note that if \W\ < \£^f then \S\ > ±£^ > f§(^) 2 . Otherwise, let U' = {b € U 2 \ j b G W}. 

no 20 \n 



so, by the Chernoff bound gT]), Pr(\U'\ < \^ (f ) ) < exp (-0(V»i)) (recall that n (|) > n > 

Jb (^^P) 2 = &{Vn)), and thus |C/'| > ^(f) 2 with probability at least 1 - exp (-6 (^/n)), so |5| > ^(f) 2 - 

Suppose then that |5| > f§ (-) ■ Assuming that n is sufficiently large, \S\ > fc/41. Let _B = U(, e [/{j; ) } 
and B\ = yJbeL-u{ib] ■ Note that every resource in Bq U B\ is used in a;' for some task b £ L. Thus, 
|-BoUBi| < £-\S\. Let R = {i\x i =0}UL-B -B 1 . Then \R\ > n +£- (£- \S\) > n + \S\ > (1 + ^)A;. 

Let T = {b | i b g L,j b e R}. E[T] = (n - £)^ and 



10 °y " V ^ y V n ) ~ \ n J 

so with probability at least 1 — exp(— 6( v / n)), T < |i?|/100. In that case, n' > \R\(1 — j^) > k. □ 

The following theorem provides a lower bound on the expected convergence time regardless of which of 
the two protocols is being used. 

Theorem 4.2. Suppose that m is even. Let X(t) be the process in Figure [2~1\ with n — 2. Let X(0) be 
the assignment given by X(0) = (m, 0). Let T be the first time at which X(t) is a Nash equilibrium. Then 
E[T] = ri(loglogm). The same result holds for the process in Figure ["PI 

Proof. Note that both protocols have the same behaviour since m is even and, therefore, the situation 
x\ — X2 + 1 cannot arise. For concreteness, focus on the protocol in Figure [2~T1 

Let y{x) — maxiXi — m/2 and let yt — y{X{t)) so yo = to/2 and, for a Nash equilibrium x, y{x) = 0. 
We will show that for any assignment x, Pr(y t+ i > y{x) l l w \ X(t) = x) > 1 — y t 1 ^ 4 . (There is nothing very 
special about the exact value "1/10" - this value is being used as part of an explicit "lack of concentration" 
inequality in the proof, noting that for a lower bound we essentially want to lower-bound the variances of 
the load distributions. This seems to require a somewhat ad-hoc approach, in contrast with the usage of 
concentration inequalities.) 

Suppose X{t) = x is an assignment with x± > x<i. As we have seen in Section^ Y\p[x) (the number of 
migrations from resource 1 to resource 2 in the round) is a binomial random variable 

B(^l(l- X A)=B(Z + y t , 2yt 



2 V x iJ J V 2 m + 2y-, 

In general, let T t be the number of migrations from the most-loaded resource in X(t) to the least-loaded 
resource and note that the distribution of T t is B + y t , m+2y t ) w ^ n mean y t . If T t = y t + £ or T t = yt — £ 

then yt+i = £■ Thus Pr(y t+ i > y\^ W ) = Pr(|Xi — E[T t ]| > y^ 10 ). We continue by showing that this binomial 
distribution is sufficiently "spread out" in the region of its mode, that we can find an upper bound on 
Pr(y f+ i < y] /W ). This will lead to our lower bound on the expected time for (yt)t to decrease below some 
constant (we use the constant 16). 
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Pr(T t = y t ) 



Pr(T t = yt + j) = 



2?n + y t 

|m + j/t 
j/ t +j 7 \m + 2y 



m + 2y t 

2yt 



yt 



( ~*Vt \ yt+j / to n 
Vm + 2wJ Vm + 2wJ 



TO 



to + 2y 4 
m 



m + 2y t , 



Suppose j > 0. 



Pr(T t =y t +j) 
Pr(T t = y t ) 



/ 2y t y/ to 
Vm + 2«J Vto + 2wJ V 



m + 2y t J \m + 2y t J \(y t + j)\(\m + y t - (y t + j)) 



3 1 



S)'(ir 



TO + 1 - £ 



/2y t y 



! yt + £ J \mJ 2y t + 2£ 



)(n 



+ 2- 2£ 



> 



/2y t y 7-rr to - 2 j \ = r/2y t \ / to - 2 j \ 



2j/t + 2j 



to / \2j/ t + 2j, 



Similarly, for j < 0, 



Pr(T t =yt+j) 
Pr(T t = y t ) 



m + £ 



1 2 



/ to \ U 
V2^. 



n 

i=i 



2y t + 2- 2£\ 
i + 2l ) 



> 



/ to \bl / 2y t - 2jj| \|j| = r/mN/ 2y t -2|jK 
V2yJ Vto + 2|?|J LUwJ V to + 2\i\ J 



m + 2\j\ 



2y t J\ m + 2\j\ 



\j\ 



(iyt \ / to 2j \ 

V m)\2yt + 2j) 



So for all j, 



Pr(T t = ^ + j) 
Pr(T t = yt) 



> 



/2yt\ / to - 2j \y = r/ ^ \/ m-2j \ 



So, for all j with |j| < y 1 ^ 4 , where y 1 ^ 4 is the positive fourth root of y t , this is at least 



1/4 „ 1/4 1/4 

/ Ift y* f m-2y t / Y* 
L+^J I to J 



> 



v t 1/4 /2yt-2y t 1/4 y t 1/4 



v yt + y t 



2j/t 



) "&) 



Y'V'" _(y, + yl"-iyl l, y' 



Vt + Vt 



Vt + Vt 



1/4 



= 1- 



2y. 



1/4 \»t 



1/4 



Vt + Vt 



1/4 



> 1 



2y. 



1/4 n yt 



1/4 



2/t 



3/4^ 



1/4 



> 1 - 2y;^yl fi = 1 - 2yr 1/2 > J 



where the last inequality just requires y t > 16. 

Note that the mode of a binomial distribution is one or both of the integers closest to the expectation, 
and the distribution is monotonically decreasing as you move away from the mode. But, for \j\ < y\^, 
Pr(T t = y t + j) > \ Pr(T t = y t ), hence Pr(T t = y t ) < 2/(1 + 2y t 1/4 ). Since Pr(T t = y t + j) < Pr(T t - y t ), it 
follows that 

Pr(T t G - y t 1/10 , yt + yl /W }) < (2y t 1/10 + 1) Pr(T t = y t ) < 3yr 3/2 °. 
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We say that the transition from y t to yt+i is a "fast round" if yt+i < Ut/ 10 (equivalently, it is a fast 
round if T t S [yt — 2/^ 10 , Ut + Vt^^])- Otherwise it is a slow round. Recall that j/o = tti/2. Let 



log 



10 



log(12 20 / 3 ) 



If the first j rounds are slow then yj > y^° 3 . If j < r then j/q° 3 > 12 20 / 3 so the probability that the 

transition from yj to yj+i is the first fast round is at most 3(yQ° j < 1/4. 

Also, if j < r then these probabilities increase geometrically so that the ratio of the probability that the 
transition to j/j+i is the first fast round and the probability that the transition to yj is the first fast round is 



3 



[Vo ) 



-3/20 

, 3 / 20 . / 1n -( J + i)\3/ 20 



3(2/0- 



10 



-3/20 



so J2 T jZo Pr(transition from yj to y 3+1 is the first fast round) < 2 • 1/4 = |. Therefore, with probability at 
least 1/2, all of the first r rounds are slow. In this case, argmin t (y t < 16) = £l(loglog(m)), which proves the 
theorem. □ 

We also have the following observation. 

Observation 4.3. Let X(t) be the process in Figure [Ql with m = n. Let X(0) be the assignment given 
by X(0) = (2, 0, 1, . . . , 1). Let T be the first time at which X(t) is a Nash equilibrium. Then E[T] = Q(n). 

The observation follows from the fact that the state does not change until one of the two tasks assigned 
to the first resource chooses the second resource. 

5. Summary. We have analyzed a very simple, strongly distributed rerouting protocol for m tasks on n 
resources. We have proved an upper bound of (loglogm+n ) on the expected convergence time (convergence 
to a Nash equilibrium), and for m > n 3 an upper bound of O (log log m) on the time to reach an approximate 
Nash equilibrium. Our lower bound of f2(loglogm + n) matches the upper bound as function of m. We have 
also shown an exponential lower bound on the convergence time for a related protocol that allows "neutral 
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